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Abstract —This paper addresses a coordination problem be¬ 
tween two agents (Agents 1 and 2) in the presence of a noisy 
communication channel which depends on an external system 
state The channel takes as inputs both agents’ actions, 

and {x 2 ,t} and produces outputs that are observed strictly 
causally at Agent 2 but not at Agent 1. The system state is 
available either causally or non-causally at Agent 1 but unknown 
at Agent 2._Necessary and sufficient conditions on a joint 
distribution Q{x 0 , 2 : 1 , £ 2 ) to be implementable asymptotically 
(i.e, when the number of taken actions grows large) are provided 
for both causal and non-causal state information at Agent 1. 

Since the coordination degree between the agents’ actions, xi.t 
and X 2 ,t, and the system state xo,t is measured in terms of an 
average payoff function, feasible payoffs are fully characterized 
by implementable joint distributions. In this sense, our results 
allow us e.g., to derive the performance of optimal power control 
policies on an interference channel and to assess the gain provided 
by non-causal knowledge of the system state at Agent 1. 

The derived proofs readily yield new results also for the 
problem of state-communication under a causality constraint at 
the decoder. 


I. Introduction 

Performance characterizations of general distributed net¬ 
works with agents that observe the system state and the actions 
of some of the other agents, is a prominent open problem 
also studied by related disciplines such as control JT] and 
game theory (2). In this paper, we contribute to the solution 
of a special case of this general problem, by treating it as 
a coordination problem that can be solved using joint-source 
channel codes. This approach has recently been proposed in 
CD, see also fQ, and is expected to extend to setups with more 
than two agents and to different observation structures. 

The technical setup under investigation is as follows. We 
consider two agents that select their actions repeatedly over 
T > 1 stages (or time-slots) and that wish to coordinate via 
their actions in the presence of a random system state. At 
each stage t £ {1,. .., T}, the action of Agent k £ {1, 2} is 
%k,t £ Tfe, with \Xk\ < 00 , and the realization of the random 
system state is xo,t £ Xq with \Xq\ < 00 . The state sequence 
Xq 1 ,... ,Xq t is given by nature and its components are 
independent and identically distributed (i.i.d.) according to a 
distribution p 0 . 

Suppose that each agent has an individual payoff function 
Wfe: Xq x X\ x X 2 —> R, for k £ {1, 2}, that is affected by 
both agents’ actions and the nature state. We are interested in 


determining the set of feasible expected average payoffs 



= E 


1 

T 


T 

Uk(x o,t, X\ tt , X2 ,t) 

t—1 


*£{ 1 , 2 }, ( 1 ) 


that are reachable by some strategies for the agents. This set 
of feasible expected average payoffs is fully characterized 
by the set of feasible averaged distributions on the triples 
{(X 0it , X ltt , X 2 ,t)}J=i- In fact, denoting by Px 0 , t x l<t x 3 , t the 
joint distribution of the time-f tuple (Xgp, Xi >t , X 2 ,t), we have 


4 T) = ^J2nMx 0 , t ,x M ,x 2 .*)] 

t= 1 

1 T 

= ^2 u} k{xo,X 1 ,X 2 )-J2 P Xo,tX lit X 2 , t ( x O,Xl,X 2 ). 

Xq€lXq t =1 

X 1 GX 1 
X 2 GX 2 

Our main goal in this pajjer is to determine the set of 
averaged distributions ^ J2t=i p x 0 ,tX ltt x 2l t ( x o, x i, x 2 ) that 
can be induced by the agents’ strategies. For simplicity, and 
in order to obtain closed form expressions, we shall focus on 
the limit T —> 00 . 

We consider two kinds of scenarios with two different 
observation structures. In the first scenario—referred to as 
non-causal coding —Agent 1 observes the system states non- 
causally. That means, at each stage t £ {1,... ,T} it knows 
the entire state sequence Xq = (Xo,i, ..., ATo,r)- In the 
second scenario—referred to as causal coding —Agent 1 
learns the states only causally. Thus, here, at each stage 
t £ {1,..., T}, Agent 1 only knows Xq. 

In both scenarios. Agent 2 has no direct access to the state 
nor to Agent l’s actions. Instead, after each stage t. Agent 2 
observes the output y t £ y, with |T| < 00 , of a discrete 
memoryless multi-access channel that takes as inputs the two 
agents’ actions and the system state. The multi-access channel 
is assumed memoryless and of transition law F: 

Pr \Xt = Utl^Q = Xq, X\ = x\, X2 = x 2 , T = y ] 

= r(yt|x 0 ,t,xi,t,a;2,t), (2) 


where throughout this paper we use the shorthand notations 
A m and a m for the tuples (Ai,..., A rn ) and (ai,..., a m ), 
when to is a positive integer. 




The scenario with non-causal coding was introduced in 
0, 0. Special cases, had previously been considered in 
0 , Pfl , ||7l , liFI . Most prominently, Gossner et al 0 solved 
the first instance of our problem. They considered the case 
where Agent 2 can observe strictly causally the system states 
and Agent 1’ actions, which corresponds in our setup to 
(0 describing the channel y t = (xtxt- x i,t)- Cuff and Zhao 
presented an alternative proof 0 of the results in j4j based 
on more traditional information-theoretic tools and under the 
framework of “coordination via actions”. The noisy commu¬ 
nication channel was introduced by Larrousse et al. in 0; 
their channel however did not depend on the system state nor 
on Agent 2’s actions. This same special case has also been 
addressed by Le Treust in 0. 

In the present work, we provide a converse proof for 
the general scenario with non-causal coding that establishes 
optimality of a scheme proposed in 0- We also solve the 
scenario with causal coding. Our result shows that in this 
case the agents’ optimal strategies are simple and ignore all 
communication over the channel. 

We exemplify our findings at hand of a power control 
problem. In particular, for this problem, we quantify the loss 
incurred in performance when the coding has to be performed 
causally instead of non-causally. 

At last, the problem under investigation is linked to the 
information-theoretic state-communication problem 0 . m. 
In fact, the proof techniques derived for the coordination prob¬ 
lems, immediately yield new results on state-communication 
when the decoder is restricted to be causal or strictly causal. 


II. Problem formulation and main results 


As explained previously, the distributed network or system 
comprises two agents. These agents take actions in a repeated 
manner according to their strategies. Strategies are sequences 
of functions defined by: 


case NC: 
case C: 
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where NC (resp. C) stands for non-causal (resp. causal) 
coding and for c £ {C,NC}, the functions (of)i<t<T (resp. 
( T~t)i<t<T ) describe the strategies employed by Agent 1 
(resp. 2). The main problem is to characterize the set of joint 
probability distributions that can be reached when T —> oo 
which we call the set of implementable distributions according 
to the terminology of 0, 0- Specifically: 


Definition 1 (Implementability). For c £ {C,NC}, the 
probability distribution Q{x o, £ 1 , 2 : 2 ) is implementable if for 
every e > 0 and every sufficiently large T, there exists a pair 
of strategies {of ., r t )i<t<T inducing at each stage t a joint 
distribution 


Xo i tX 2i tX 2f t i x 0 , x !, X 2 ) P X\ t X 2 t\Xo t {xi > x 2 |-To )Po (to) 

(5) 


such that for all (xq, Xi, X2) £ Xq x X\ X TV 


2 • _ 

~rp ^ ^ ^ Xq^Xi^X 2] t { x 0 , Xi , X 2 ') Q{ x 0 , X \, xf) 

t=1 


<e. (6) 


We now characterize the set of implementable probability 
distributions both for causal and non-causal coding. 


Theorem 1 (Non-causal coding). Let c = NC. 
Consider a joint probability distribution Q such that 
x 2 Q( x 0t x h £ 2 ) = po{xo). The distribution Q is 

implementable if and only if it satisfies the following 
conditioi {] 


Iq{ A'o; X 2 ) < Iq{V ; Y\X 2 ) - I Q (V; X 0 \X 2 ) (7) 

with Q(xo,xi,x 2 ,y,v) = Q{xo,xi,x 2 )T{y\x 0 ,xi,X 2 ) 
x Pv\x 0 x 1 x 2 ( v \ x o, x i, x 2 ), and V being an auxiliary ran¬ 
dom variable which alphabet cardinality can be restricted as 

|V| < |Tfl| • 1-Til • \X 2 \. 

Proof: See Section [V] ■ 

Theorem 2 (Causal coding). Let c = C. The set of iniple- 
mentable distributions is given by the set of distributions under 
the form 

Q{x 0 ,x x,x 2 ) =P Xl \x 0 x 2 { x i\ x o, x 2 )Px 2 { x 2 )po{ x o)- (8) 

Proof: See Section [VT] ■ 

Note that no information constraint appears in this second 
theorem. This is related to the fact that in the case of causal 
coding no benefit can be obtained by communicating over the 
channel: Agent 2 can simply ignore the channel outputs. In 
particular, when the two agents are interested in maximizing a 
common payoff function w{xo,x\,x 2 ) possible strategies are 
as follows: Agent 2 chooses a constant action X 2 ,t = x 2 , and 
for each stage t Agent 1 picks an action x\ t £ X\ in function 
of this x 2 and of the nature state xqj so as to maximize the 
payoff function on the current stage: 

x\ t £ arg max w{x 0 ,t, ii, £ 2 )- (9) 

s 1 ex 1 

This strategy is referred to as a semi-coordinated policy in the 
context of coded power control introduced in 0. 


III. Application: Power Control 

We exemplify the above two theorems at hand of a 
power control problem. In particular, we wish to illustrate 
the loss in performance incurred when the coding is only 
causal instead of non-causal. An interference channel with 
two transmitter-receiver pairs is considered. Transmissions are 
assumed to be time-slotted and synchronized. For k £ {1,2} 
and “j = (—k stands for the transmitter other than 

k), the signal-to-interference plus noise ratio (SINR) at Re¬ 
ceiver k at a given stage writes as SINR^ = a 2 + k g x ^ x 
where Xk £ %k = {0, -Pmax} is the power level chosen 
by Agent or Transmitter k, gkj represents the channel gain 


The notation Iq (A : B) indicates that the mutual information should be 
computed with respect to the probability distribution Q. 






of link kj, and a 2 the noise variance. We assume that: 
gkj G {Smin,3max} is Bemouilli distributed g kj ~ B{p kj ) 
with P (g k j = 3 m in) = Pkj ; the global channel state is thus 
given by a; 0 = (311,312,321,322). We define SNR(dB) = 
101og 10 and set g min = 0.1, 3 max = 2 , a 2 = 1, 
and {pn,pi 2 ,P 2 i,P 22 ) = (0.5,0.1, 0.1, 0.5). The considered 
common payoff function is w(xq, x\, X 2 ) = Efc=il°g 2 (l + 
SINRfc) and the signal Y observed by Agent/Transmitter 2 is 
assumed to be the output of a binary symmetric channel with 
transition probability e = 0.05. 

Rg. m represents the maximum expected sum-rate against 
SNR in dB for our two scenarios with causal and non-causal 
coding at Transmitter/Agent 1. (For practical reasons we re¬ 
strict to | V | = 10.) These two scenarios are compared to a sce¬ 
nario with costless communication where Agent/Transmitter 2 
observes xf non-causally and thus the maximum of w can be 
reached at any stage, and to a scenario where the two agents 
don’t coordinate but simply transmit at full power throughout. 



Fig. 1. Expected payoff against SNR(dB). One message from the figure is 
that good coordinated power control policies may perform quite close to the 
maximum sum-payoff for certain standard payoff functions; here the chosen 
sum-payoff is the sum-rate. Another message is that, for certain standard 
payoff functions designing non-causal power control policies may not bring 
very significant performance gains over causal power control policies. 


IV. Related Results on State-Communication 

Consider again the setup of Sections U and [HI but—in line of 
previous works on state-communication 0. 01 —specialize 
the channel in (0 to a state-dependent discrete memoryless 
channel (DMC) with state xq ,t and single input x\ it 11 

F(y t |a:o i t,a:i it ,a;2,t) = ^(yt\x 0 ,t,xi yt )- (10) 


2 Traditionally, in state-communication the channel inputs are denoted 
x\,,xt, the state symbols s\,... ,st and the reconstructed symbols at 
Agent 2 si,..., st- For coherence, here we keep the notations introduced 
in the first part of the paper. 

3 Our results readily extend also to the more general channel in 0. 


Initially, we consider non-causal coding functions and causal 
or strictly causal decoding functions: 


case NC-enc/C-dec: < 
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case NC-enc/SC-dec: < 
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where C (resp. SC) stands for causal (resp. strictly causal) 
decoding. 

The goal here is that Agent 2 can produce a reconstruction 
sequence x\ that matches the state sequence Xq up to an 
allowed distortion. In particular, for d g {C, SC}, Distortion 
D > 0 is said achievable under a given bounded single-letter 
distortion function 

5: A 0 x AW M max ], (13) 


if for every e > 0 and sufficiently large blocklengths T it is 

possible to find encoding and decoding functions 

and such that Agent 2’s reconstructed sequence Xj 


satisfies 


E 


-Y;s(x 0 ,t,x 2i t) 

t -1 


< D + e. 


(14) 


Theorem 3 (Non-causal coding). Let d = C, i.e., decoding is 
causal. Distortion D is achievable if and only if 


E[S(X 0t g(U,Y))]<D, (15) 


for some function g: U X y —> X 2 and some joint law 
QxqXiYUv{xoi Xi,y, u, v) that factorizes as 

Po(xo)P UVXl \x 0 (u, V, Xx |tco)T(j/|a:o, £ 1 ); (16) 

and satisfies 


Iq{U-X o) < I q (V;Y\U) - Iq{V ; X Q \U). (17) 

The input Xx can be restricted to be a function of (U, V, Xq). 

Proof: Omitted for brevity. ■ 

Remark 1. Theorem [T] remains valid for d = SC, i.e., when 
decoding is strictly causal, if O is replaced by 

E[S(X 0 ,g(Y))]<D. (18) 

In this case, one can restrict to the choice U = g(Y) = A' 2 . 

Assume now that encoding is causal. The setup is as 
described above, except that the encoding functions in (fTTb 
have to be replaced by functions of the form 

a t c : Xl -> Xx- (19) 

Theorem 4 (Causal coding). Let d = C, i.e., decoding is 
causal. Distortion D is achievable if and only if 

E[S(X 0 ,g(Y))]<D, (20) 


for some function g: y —> X% and a joint distribution 
Px 0 x 1 y(xo,xx,y) that factorizes as 

Po{xq)P Xi \x 0 (xx \xq)Py\x 0 x 1 {y\xo,xx). (21) 












Let d = SC, i.e., decoding is strictly causal. Distortion D is 
achievable if and only if there exists a constant value x 2 G X 2 
so that 

E[8(X 0 ,x 2 )]<D. (22) 

Proof: Omitted for brevity. ■ 


Remark 2. In combination with previous results on state- 
communication EES, our results provide the following 
insights. When the decoder is non-causal, Wyner-Ziv coding 
has to be used to compress the state. This is not possible 
anymore when the decoder is only causal, where standard 
compression suffices. When the encoder is non-causal, then 
Gelfand-Pinsker coding should be used to communicate over 
the channel. When the encoder is only causal, this is not 
possible anymore and the less powerful Shannon strategies 
suffice. When the encoding and the decoding are causal or 
strictly causal, then no coding is needed anymore; simple 
symbol-by symbol strategies at the transmitter (Agent 1) and 
the receiver (Agent 2) are sufficient. 


V. Proof of TheoremQ] 

The proof of TheoremQ]can be divided into three parts: the 
direct part, which is established in 0 and omitted for brevity; 
the bound on the cardinality of the auxiliary alphabet |V| which 
is also omitted; and the converse, which shows optimality of 
the coding scheme in H and is proved in the following. 

Converse. Let Q be an implementable distribution, and fix 
an arbitrary e > 0. 

By definition, there must exist a sufficiently large block- 
length T and strategies {<7^°}^ and {rt}f = i such that for 
t G {1 the tuple (Xo,t, Xij, X 2 ,t) induced by these 

strategies has a joint law Px 0 t Xi t x 2 t that satisfies 


1 

T 


T 

^ ^ Xi'tX 2 ,t ('t'O; , x 2 ) Q(x 0 , x ±, x 2 ) 

t=l 


<e. (23) 


For each t, let Y t denote the output of the channel for state 
X o t and inputs X\ f and X 2 j,. For each positive integer m, 
we use the shorthand notation Af n to denote the random tuple 
(A m ...., At). Let Z be a random variable that is uniformly 
distributed over {1,..., T} independent of X f, X f, X'f, Y T , 
and define for each t. 


V t ±(Xl t+1 ,Y t ~ 1 ). (24) 

Finally, let V 4 (y z , Z), X 1 4 x 1>z , Y = Y z , X 0 = X 0 , z , 
and X 2 = X 2 ,z, and denote the probability distribution of the 
tuple ( V , X 0 ,Xi,X 2 , Y) by Qvx 0 x 1 x 2 y- Notice that this law 
factorizes as 


QvxoX 1 x 2 Y(v,x 0 ,x 1 ,x 2 ,y) 

= Po{xo)Qvx 1 x 3 \x 0 {v,x 1 ,X2\xo)r(y\xo,xi,x 2 ), (25) 


with a marginal law satisfying 

E Qvx 0 x 1 x 2 Y(v,xo,xi,x 2 ,y) 

v,y 

1 T 

= Pxo.tx^xi, t ( x o,xi,x 2 ). 

t=i 


The Markov chain Y - (X 0 ,X 1 ,X 2 ) - V in © 
holds because by the memorylessness of the channel (0, 
Y t - (Xo, t ,X ltU X a ,t) - (X^X^X^Y*- 1 ^) forms 
a Markov chain for any t G {1,...,T}, and be¬ 
cause the time-sharing random-variable Z is independent of 

(XZ,XT,XZ,YT). 

We continue with the following sequence of equalities: 

2r(xZ;V T ) 

= = Y HXo,t; r r |A'J, +1 ) 

t= 1 

= ffi'52 I ( x o,C,Y T ,X^ t+1 ) 
t= 1 

= ^E [/(Xo, t ;F‘,X 0 T t+ i) +I(X 0 ,t;Y t T +1 \Y t ,XT t+1 )\ 

t= 1 

t= 1 

+I(X oX ,Y t T +1 \Y t ,XZ t+1 )\ 

1 T 

- t E [ I ( X oAY t ,V t ,X 2:t ) + I (X 0!t -,Y^ 1 \Y t ,V t )], 

t= 1 

= I(X 0 ,z, V z ; Y z \X 2 , z , Z ) + I{X 0 ,z ; Y£ +1 \Y z , V z , Z) 

= I{X 0 , V; Y\X 2 ) + /( X 0 ,z; Y Z+1 \Y Z , V z , Z), (27) 

where (a) follows from the chain rule of mutual information; 

(b) by the i.i.d-ness of the state sequence ... ,Xq t x)j 

(c) by the chain rule of mutual information; (d) because X 2 t 
is computed as a function of F t_1 ; (e) by (l24l >; and the last 
two equalities by the definitions of (Z, V z , Xq,z, X 2 ,z, Yz) 
and (V,Xo,X 2 ,Y) and the independence of Z and Xq z- 

On the other hand, 

\l^l-Y T ) 

= \ E [ J (*o,t; YT \ X lt+i) + HY*-, -Y 0 T t+1 ) 

t= 1 

= E [^ X oY Y 4 ) + I(X 0tt - Y t T +1 1 Y t Xl t+1 ) 

t= 1 

— I(Y t ~ 1 ;X^ t ) 

= E [ 7 ( X o T t ; Yt IF 4 - 1 ) + I(X o it ; Y t T +1 \^X^) 

t= 1 

t= 1 

< ^E + i^yy^xi^) 

t= 1 


( 26 ) 









= \ E [ 7 (*o,t, Vs, Y t \X 2 , t ) + I(X 0it ; Y t T +1 \Y t , V t ) 

t= 1 

= /(X 0 ,z, 14; F Z |X 2 , Z , Z) + /(X 0 ,z; y£ +1 \Yz, V z , Z) 

(m) 

< J(Xo,y;y|X 2 ) + /(X 0i z;lf +1 |yz,Vz,Z), (28) 

where (/) follows from the chain rule of mutual information 
and from the Csiszar-Kramer telescoping identity ED; (g) 
and (h) follows by the chain rule of mutual information; (i) 
because X 2jt is computed as a function of Y t_1 ; (j) follows 
because conditioning cannot increase entropy; (k) by (l24l >; (£) 
by the definitions of (Z, Vz,Xq^z, X 2j z, Yz)\ and (to) by the 
definitions of (V,Xq,X 2 ,Y) and the independence of Z and 
Xo } z- Combining ED and (l28l >. we obtain 

/(Xo; y, V, X 2 ) < /(Xo, V; Y\X 2 ), (29) 

which by chain rule of mutual information is equivalent to 


be independent of Xo. These observations combine to establish 
that the joint law of (Xo,Xi,X 2 ) has to factorize as in (1331) . 

Achievability: Consider a joint distribution Q(xo, Xi, x 2 ) 
that factorizes as Q(x o,Xi,x 2 ) = Po( x o)Px 2 {x 2 ) x 
Px 1 \x u x 2 ( x i\xo./x 2 ). Fix small e 2 > e\ > 0 and an arbitrary 
blocklength T. Then, pick a T-length sequence x 2 ) i,..., x 2i t 
that lies in the typical set TtP (Px 2 )\ see flLZl p. 25] for a 
definition of this typical set. 

The two agents produce the following actions. At stage 
f 6 (1,.Agent 2 produces x 2t . Agent 1 produces the 
random action X- [ t that it draws according to the conditional 

law Px 1 \x 0 x 2 ('\ x o,t, % 2 ,t)- 

We analyze the proposed strategies. Define the event: 

e(T) = {(*o T , Xl, 4) i T e f > (Q) }. (34) 

By the weak law of large numbers, and the conditional 
typicality lemma [Q21 p. 27], 


I(X 0 ; X 2 ) < /(X 0 , V; Y|X 2 ) - I(X 0 ;Y, y|X 2 ) 

= /(y ; y|x 2 )-/(y ; x 0 |x 2 ). (30) 

By ( l25l ). ( l26l >, and ( l30l >. we conclude that the joint law 
Y YPt=i Y , Xo,tX 1 , t x 2 ,t( x o, x u x 2 ) satisfies the conditions on 
implementable distributions that we stated in the theorem. In 
view of (l23l >. since e > 0 can be arbitrary small, and by con¬ 
tinuity of mutual information, then also the law Q( x o, xi, x 2 ) 
must satisfy these conditions. 


VI. Proof of Theorem[ 2 ] 

Converse. Let Q be an implementable distribution. Fix 
e > 0 and sufficiently large T. By definition, there must 
exist strategies such that for each t £ {1,... ,T}, the triple 
(X 0>t ,Xi |t ,X 2|t ) has a joint law Px 0 , t x ltt x 2it that satisfies 


_ Q( x o,xi,x 2 ) 

t =1 


<e. (31) 


Let Z be a random variable that is uniformly distributed over 
{1,...,T} and independent of Ajf. X(. X-f. Y r . Further, 
define X 0 = X 0) z, = Xi,z, X 2 = X 2 ,z, Y = Y z . 

Denoting the probability mass function of the triple 
(X 0 ,Xi,X 2 ) by Qx 0 x iX 2 , by the definitions above. 


Qx 0 ,x 1 ,x 2 {xo,xi,x 2 ) = — E P X 0 , t X 1 , t X 2 ,t {xq,Xi,X 2 ). 

t —1 

(32) 

We will prove that the law Qx 0 xiX 2 factorizes as 


Po(xo)Qx 2 (x 2 )Q Xi \x 2 ,x 0 (xi\x 2 ,x 0 ). (33) 

By (l32l >. by continuity, and by dTil . this will imply that also 
Q factorizes in this way, and thus conclude the proof. 

To prove (l33l >. we first notice that for any t £ {1,,T}, 
by the causality of the decoding, X 2i < depends only on F* -1 . 
By the causality of the encoding this latter is independent of 
X(j t. Thus, X 2 ^—o—Z—o— Xq z forms a Markov chain. Since 
Z and X() z = Xq are independent, X 2 = X 2 ,z also needs to 


lim P(E {t) ) = 0. (35) 

T-i-oo v ’ 

Since e 2 > 0 can be chosen arbitrarily small, by Proposition 5 
in 0, this establishes the desired achievability result. 

Appendix A 

Proof of Cardinality Bound 
Let us prove that in Theorem Q] it suffices to choose V of 
cardinality 

|V|<|*oM* 1 |.|X 2 |. (36) 

Let V denote the set of pmfs over X f) x X\ x X 2 . For each triple 
(xo, Xi, x 2 ) £ X 0 x X\ x X 2 except for one triple (xq, x*, X2) 
that one can freely choose, define the following continuous 
real-valued functions: 


9{x 0 , 2 : 1 , 2 : 2 ) :pefop(xo,x i,x 2 ). (37) 

Also define the continuous real-valued function go as on top 
of the next page, see (1381) . 

Now, fix a 5—uple (V,Xo,Xi,X 2 ,Y) satisfying the con¬ 
ditions in the theorem, and where V is allowed to be over 
any desired alphabet V which can even be of unbounded 
cardinality. Let Qx 0 x,x 2 denote the joint law of (Xo, A'i, X 2 ) 
and Fy(-) the cumulative distribution function of V. For each 
v £ V, let p x \v=v{'i v) G V denote the conditional law of 
the tuple (A'o, A'i,X 2 ) given V = v. 

For any tuple (xo,Xi,x 2 ) £ Xq x X\ x X 2 for which the 
function g^ x0lXllX2 ) i s defined, we have 

/ 9(x 0 ,x 1 ,x 2 )(Px|u=d)F 1 /(u) = Qx 0 x 1 x 2 ( x o,xi,x 2 ). (39) 

Jv 

Moreover, 

[ 9o(px\v=v)F v (v) = H(Xo,X 2 \V) - H(Y,X 2 \V) 

Jv 

= H{X 0 | V, A 2 ) - H(Y\V, A 2 ). (40) 

By the Support Lemma, m Appendix C], there exists a 
set V satisfying (l36l) . a probability mass function Qy{-) over 
V , and |V| conditional probability distributions {p v £ P} ve y 





g 0 : P £ V H> 


E 




'p( x 'o- x 'i- x 'i) log 


(x' 0 ,x' 2 )GX 0 X X-i Vx'jGAfi 



+ E 

{x' 2 ,y')ex 2 xy 


(iJ i ,i' 1 )6A’oX^i 


p{x' 0 ,x' 1 ,x' 2 )T(y'\x' 0 ,x' l ,x' 2 ) log 


p{x , 0 ,x' 1 ,x' 2 )T(y , \x' 0 , 


. (iJ,,iJ)6A’oX<Vi 



such that for any tuple (xo, * 1 , x 2 ) £ XqX X\X X 2 for which 
the function g( Xo ,x 1 ,x 2 ) I s defined, 

/ g(x 0 ,xi,x 2 ) (PX| V r =«)f'v r ( v ) 

Jv 

= ^2 9(x 0 ,xx,x 2 )ijPv)Qv{v) (41) 

v£V 

and 

J gn(pyL\v=v)F v ( v ) = '^29o{Pv)Qv(v). (42) 

v ^ev 

Define the 5—uple (V, Xq, Xi, X 2 , Y) to be of law 

Qv(v) ■ p v (x 0 ,x 1 ,x 2 )r(y\xo,x 1 ,x 2 ). (43) 

By definition (l43l i. by (l39l and by {TT}. the tuple 
(Xfj. X \, X 2 . Y) has the same law as the original tuple 
(X 0 ,X 1 ,X 2 ,Y): 

Pr [X 0 = xo, Xi = xi,X 2 = x 2 , Y = y] 

= Qx 0 ,. x 1 ,x 2 (xo,x 1 ,x 2 ) ■ T(y\x 0 ,x 1 ,x 2 ). (44) 

Moreover, by ( l40b and (l42l >. and Definition ( f38l ). the relevant 
mutual informations are also preserved: 

H(X 0 \V,X 2 )-H(Y\V,X 2 ) = H(X 0 \V,X 2 )-H(Y\V,X 2 ), 

(45) 

and as a consequence, by d44l> . 

I(Y ; V\X 2 ) - I(X 0 ; V\X 2 ) = I(Y ; V\X 2 ) - I(X 0 ; V\X 2 ). 

(46) 

This concludes the proof. 

Appendix B 
Proof of Theorem[ 3 ] 

A. Achievability 

Consider a joint distribution Quvx 0 x 1 x 2 y G A (U x V x 
Xq x Xi x X 2 x y) and a decoding function g:U xy —y X 2 
that satisfy Conditions (fT5l >— (fl7l) in the theorem. 

Fix small e > e > £3 > e 2 > > 0, and pick positive rates 

R, R in a way that we specify later on. 

Codebooks generation: Split the blocklength T into B 
blocks each of length n = \T/B\. For each block 

b e {1 ,..., B} randomly generate a codebook con¬ 
taining the n-length codewords {u( fe l(l), ..., 
and a codebook Cy' 1 containing the n -length codewords 
{u( b) (l,l),...y b l(|2 n - R J, L2”^J)}. All entries of all code¬ 
words of codebook cff are drawn i.i.d. according to the 


marginal distribution Qjj. Independent thereof, all entries of 
all codewords of codebook Cy'* are drawn i.i.d. according to 
the marginal distribution Qy. 

Encoding: Set i\ = js = 1. For each block b C {1,..., B}, 
let Xg b * denote the state sequence corresponding to block b. 

For each block b € {1,..., B }, the encoder (Agent 1) looks 
for an index e { 1 ,..., [ 2 "^]} such that 

(x^,uW(4)) eT^iQxou). (47) 

If there is more than one such index, it chooses the smallest 
among them, otherwise it declares an error. For b = 1 ,B— 
1 , set j b = i b+ i. 

For each block b £ {1,..., B}, the encoder looks for an 
index it € { 1 ,..., [ 2 n ^J} such that 

( x Q b ' > ,u( b *(ib),v( b *(jb,£b)') G Tj"*(Qx 0 uv)■ (48) 

If there is at least one such index, it picks one of them at 
random, otherwise it declares an error. The encoder finally pro¬ 
duces its F-th input of block b, Xi^t-i )n+t', by applying the 
conditional law Qx 1 \uvx 0 to the triple of symbols obtained 
by taking the P-th components of the codewords (it) and 
v( b \jb,£b) and the state vector Xg b *. 

Decoding: Let 4 = 1. Fix t G {1.... . 7’} and let b denote 
the block to which time t, belongs to, i.e., b = \t/n\. Decoding 
at time t £ {1,..., T} depends on output y t and on the index 
it that—as we will see in a moment—the decoder (Agent 2) 
produced in a previous decoding step. Specifically, the decoder 
produces x 2 .t by applying the decoding function g to the (t — 
(b — l)n)-th component of codeword u^ b \it) and to y t . 

If t is a multiple of n, i.e., we reached the end of a block, 
the decoder also looks for indices (jb,ib) G {l,..., [2 nR \ } x 
{l,..., |_ 2 ™^J } such that 

(uM(i b )M b) 0b,h),y {b) ) cT^(Quvy). (49) 

If there is at least one such index, pick one of them at random. 
Otherwise declare an error. Set 4+1 = jb- 

Analysis: We analyze the expected average distortion, where 
the expectation is taken with respect to the choice of the 
codebooks and the random realizations of the state and the 
channel. Define for each block b £ {1,..., B}, 

E b = {(x^ b) ,x[ b *,xi b *) i rS n) (Qx oXl x 2 )}, 


(50) 



and 


B 

E 2 :B = (J E b . (51) 

6=2 

We proceed to show that P(i? 2 :s) can be made arbitrarily 
small for n sufficiently large. We introduce the following 
events in each block b £ {1,..., B}. 


Whenever I Q (X 0 ;U) < Iq(V;Y,U) - Iq(V:X 0 ,U) = 
Iq(V;Y\U)-Iq(V;X 0 \U), and ei,e 2 ,e 3 > 0 are sufficiently 
small, it is possible to find rates R,R> 0 such that 

R> I Q (X 0 ;U)+6(e 1 ) (57a) 

R>I Q (V;Xo,U)+5{e 2 ) (57b) 

R + R<I Q (V;Y,U)-6(e 3 ), (57c) 


4 b) = { i T£\Qx 0 u)Vi € { 1 ,. ■ •, [2 nR \}} 

E[ b) ^ {(x^,U^(i b ),V^(j b ,£)) i T^HQxow) 

We {l,...,|2"*J}} 

e { 2 ] ±{(x$\uW(i b )M b Xjb,h),x?\YW) 

£ T^XQxoUVXiy)} 

E (») e T^XQuvy) 

for some j e {1,..., [2”^] }\{j b }, 

*e{l,...,|2 n *|}} 

4 b) = {(* t\uM(i b ),V( b) (j b ,eb),xi b \Y<- b \X? ) ') 


The probability P(E 2: b) may be upper bounded asQ 

p(e 2:B ) < Y [p(4 6) )+p(^ 6) i4 6)c ) 


6=1 L 


+ P(^ 6) |^ 6)C ) + P(E^ ) \E^ )C ) 


(b) | j?(b)c\ 


B 


+Y p ( E i b) \Er c ) 


(52) 


6=2 


Throughout this paragraph, 6(e) stands for a function that 
tends to 0 as e —>• 0 . 

• By the covering lemma fl2l . if R > Iq(X 0 ; U) + <5(ei), 
then for any b € {1,2,..., B}: 

lim E (p(eX > X'] = 0- (53) 

n—>• oo V v u 7 / 


• By the covering lemma, if R > Iq(V; Xq,U) + <5(e 2 ), 
then for any b £ {1,2,..., B}\ 

lim e(p(£ 1 (6) |£;® c )') =0. (54) 

n—>-oo \ v 7 / 

• By the conditional typicality lemma fT2l . for any b £ 
{1,2,..., B}: 

]imE(p(Ei b) \E[ b)e )) =0. (55) 

• By the packing lemma liTZl . if R + R < Iq(V; Y, U) — 
<S(e 3 ), then for any b £ {1,2 ,..., B }: 

lim e(p(£ 7? ) |£7? )c )'\ = 0. (56) 

n —>oo \ ' 7 / 


4 Here we also used the fact that event 


E^ )c nE^ )c implies i(,+i = *6+1- 


Thus, we conclude that P(E 2 -b) can be made arbitrarily 
small by choosing n sufficiently large and ei,e 2 ) e 3 > 0 
sufficiently small. Define now 

E 4 {(X 0 T ,Xf,X 2 T ) i TYKQxoX^)}- (58) 

Since e > e, by choosing B sufficiently large, the probability 
P (E) can be made as close to P(E 2] b) as one wishes. 
Thus, we conclude that when n, B are sufficiently large and 
ei, e 2 , e 3 > 0 are sufficiently small, it is possible to have 


P(£7) < e. 


(59) 


Assume now that ( l59l > holds. Under this assumption, we 
can bound the expected distortion (where the expectation is 
with respect to the choice of the codebooks and the channel 
realization) by 


K[-Y S ( X o,uX 2 , t ) 


t= 1 


= E(-^(XcM,* 2 , t )£)p(£) 


+E (^E^o,f,* 2 , t )|i^ P(E C ) 

< dm &x P(E) + (D + ed max )P(E c ) 

< D + 2 ed max , 


(60) 


where the first equality follows by the total law of ex¬ 
pectations; the first inequality because the distortion func¬ 
tion is bounded and by the definition of the typical set 
Te T) (Qx 0 x 1 x 2 )\ and the last inequality by ( l59l ) and because 
a probability cannot exceed 1 . 

We see that the expected average distortion—where the ex¬ 
pectation is taken with respect to the choice of the codebooks 
and the realizations of the state and the channel—can be made 
smaller than D + 2ed max when T is sufficiently large. As 
a consequence, there must be at least one realization of all 
codebooks such that the expected average distortion is no 
larger than D + 2ed max . Since e > 0 can be chosen arbitrarily 
close to 0 , this concludes the achievability proof. 

By continuity the above proof can be applied also when 
Iq(X o; U) < I Q (V ; Y\U)-Iq(V- X 0 \U) holds with equality. 


B. Converse 

Let D > 0 be an achievable distortion. Fix e > 0 
and T sufficiently large. By definition, there exist coding 
and decoding functions {<t£ ,c }^L 1 and {} 7 =i so that the 






sequences Xj. X'J\ Y 1 induced by these functions and by 
the channel (flOt . satisfy 


<D + e. 


(61) 


Let Z be a random variable that is uniformly distributed over 
{1,..., T} independent of Xjf, Xj. Xj. Y T , and define for 
each t. 


U t = (F 4 - 1 ) 

F = (* 0 T t+1 ). (62) 


Finally, let U = ( U Zl Z ), V = (V z ,Z), X 0 4 X 0 , z , 
Xi = X\ z, X 2 = X 2 ,z, and Y = Y z , and denote the 
probability distribution of the tuple (U,V, Xo, Xi, X 2 ,Y) by 
Quvx 0 x 1 x 2 y- By these definitions. 


and 


E(<5(X 0 ,X 2 )) = E 



^(X 0 ,t,X 2 , t ) 

t=1 


(63) 


= ^E J ( A o^ yT l A o T t +1 ) 

t= 1 

= y E [ J (*o,*; YT \ x o,t+i) + kx*-, x 0 T t+ i) 

t=i 

= i E [ J ( A o r t ; F‘) + /(X 0 , t ; y^ilF*^!) 

t=l 

= Y E [ 7 ( A o r t ; FtIF 4 - 1 ) + I{Xo,f, Y t T +1 |F% T t+1 )" 

t=l 
1 T 

= ^E [ J (^o,t,F;F t |L r t ) + /(X 0 , t ;F^ 1 |F t ,£FF) 

i=l 

= I{X 0i z, F z ; F z |f/ z , Z) + I(Xq z', Fj +1 |F Z , C/ z , F z , Z) 

= J(X 0> V; F|Z7) + J(X 0 ,z; Fj +1 |F Z , £/ Z) F z , Z). (67) 

Combining ( l66t and ( |67] >. we obtain 


Quvx 0 x 1 x 2 y(u, v, xo,xi,X2, y ) 

= Po(xo)Quvx 1 x 2 \x 0 (u, V, xi, x 2 |x 0 )r(y|x 0 , xi). (64) 

The Markov chain F — (Xo, X \) — (U, V, X 2 ) holds because 
by the i.i.d.-ness of the channel ®, F t - (X 0 , t ,X M ) - 
(Xq, Xf , Xj, F 4_1 ,F+i) forms a Markov chain for any 
t € {1,..., T}, and because the time-sharing random-variable 
Z is independent of (X(f , Xf, Xj, Y T ). 

In the following, we prove that 

I Q (X 0 ;U) < I Q (V;Y\U) - Iq(V; X 0 \U), (65) 

which combined with (f6Tb . (l63l >. and ( 1641 . by continuity, 
establishes the desired converse. 

To prove (l65l >. we first notice that on one hand, 

E(* 0 t ;F t ) 

= i^/(X°,t;F T |X 0 T t+1 ) 

t =1 

= i^/(X°,t;F T ,X 0 T t+1 ) 

t= 1 
1 T 

= ? E [^o.L^XoV) + 

t=l 

1 T 

= - E [/(^o,t; Ft, Ft, Ut) + I(X 0 ,f,Y t T +1 \Y u Ut, Vt )], 

t= 1 

= /(X 0 , z ; Y z , V z , U z \Z) + /(X 0jZ ; Fj +1 |F z , J7 z , F z , Z) 
= /(Xo; F, F, t/) + J(X 0>Z ; Fj +1 |F Z , C/ z , F z , Z). (66) 

On the other hand, we have that 


/ Q (X 0 ; Y, F U) < I Q (X o, F; F|E7), (68) 

which by chain rule of mutual information is equivalent to 

Iq(X 0 \U) < Iq{X o, F; Y\U) - J Q (X 0 ; F, F|C/) 

= /q(F;F|C/)-/q(F;X 0 |C/). (69) 
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